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DETAILED ACTION 



Claim Rejections - 35 USC § 102 
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

1. Claims 1-3, 5, 18-19, 24-25 and 30 are rejected under 35 U.S.C. 102(b) as being 

anticipated by Kosaka et al (US Patent No. 5,220,629), hereinafter referred to as Kosaka. 
Kosaka discloses a speech synthesis apparatus and method. 

Regarding claim 1, Kosaka discloses determining a number of morae representing a 
syllable count in the sentence or word at col. 8, lines 42-45, which reads on "counting syllables 
in each word of said text segment". Additionally, at col. 12, lines 55-64 Kosaka discloses a 
parameter expansion/reduction rate setting means and col. 13, lines 1-10 discloses that 
parameters are expanded/reduced, and connected to coincide with a syllable beat point pitch set 
by a syllable beat point pitch setting means, which reads on "assigning a playing rate indicator to 
said each word of said text segment based on a total number of syllables in said word." 

Regarding claim 2, Kosaka discloses everything as claimed in claim 1 . Additionally, 
Kosaka discloses expansion/reduction of speech segments for speech synthesis at col. 8, line 63 
continuing to col. 10, line 62. 
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Regarding claim 3, Kosaka discloses everything as claimed in claim 2. Additionally, 
Kosaka discloses that the period lengths of speech segments are changed in accordance with a 
change in morea (col. 9, lines 39-40). 

Regarding claim 5, Kosaka teaches changing the utterance speed of the based on 
threshold criteria at col. 9, line 34 - col. 10, line 16. 

Regarding claim 18, Kosaka discloses implementation of the program in a central 
processing unit at col. 6, lines 50-51, which reads on "processor" and "persistent storage memory 
in communication with said processor, storing processor readable instructions". Additionally, 
Kosaka discloses text input means at col. 5, line 64. Kosaka discloses determining a number of 
morae representing a syllable count in the sentence or word at col. 8, lines 42-45, which reads on 
"counting syllables in each word of said text segment". Additionally, at col. 12, lines 55-64 
Kosaka discloses a parameter expansion/reduction rate setting means and col, 13, lines 1-10 
discloses that parameters are expanded/reduced, and connected to coincide with a syllable beat 
point pitch set by a syllable beat point pitch setting means, which reads on "assigning a playing 
rate indicator to said each word of said text segment based on a total number of syllables in said 
word." • 

Regarding claim 19, Kosaka discloses everything as claimed in claim 18. Additionally, 
Kosaka discloses expansion/reduction of speech segments for speech synthesis at col. 8, line 63 
continuing to col. 10, line 62. 

Regarding claim 30, Kosaka discloses everything as claimed in claim 18. Additionally, 
Kosaka discloses that the period lengths of speech segments are changed in accordance with a 
change in morea (col. 9, lines 39-40). 
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Regarding claim 24, Kosaka discloses implementation of the program in a central 
processing unit at col. 6, lines 50-51 , which reads on "a computer readable medium storing 
computer software". Additionally, Kosaka discloses text input means at col. 5, line 64. Kosaka 
discloses determining a number of morae representing a syllable count in the sentence or word at 
col. 8, lines 42-45, which reads on "counting syllables in each word of said text segment". 
Additionally, at col. 12, lines 55-64 Kosaka discloses a parameter expansion/reduction rate 
setting means and col. 13, lines 1-10 discloses that parameters are expanded/reduced, and 
connected to coincide with a syllable beat point pitch set by a syllable beat point pitch setting 
means, which reads on "assigning a playing rate indicator to said each word of said text segment 
based on a total number of syllables in said word." 

Regarding claim 25, Kosaka discloses everything as claimed in claim 24. Additionally, 
Kosaka discloses expansion/reduction of speech segments for speech synthesis at col. 8, line 63 
continuing to col. 10, line 62. 

Claim Rejections - 35 (JSC §103 
The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness 
rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

2. Claim 4 is rejected under 35 U.S.C. 103(a) as being unpatentable over Kosaka in view of 

Hutchins (US Patent No. 5,384,893). 
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Regarding claim 4, Kosaka teaches everything as claimed in claim 3. Kosaka does not 
specifically teach decreasing the duration of pauses associated with selected punctuation in a text 
segment. However, it is well known to modify output duration of text-to-speech synthetic 
speech to reflect various punctuation in the text of interest. 

In a similar field of endeavor, Hutchins discloses a method and apparatus for speech 
synthesis based on prosodic analysis. Specifically, at col. 7, lines 44-56, Hutchins teaches 
implementation of a prosody generator, which "adjusts duration before and stress following 
punctuation." 

Therefore, it would have been obvious to one of ordinary skill at the time of invention to 
modify the speech synthesis system of Kosaka to implement modifying the duration of pauses 
associated with punctuation in a text segment, as taught by Hutchins, for the purpose of making 
the synthetic speech sound more like conversational speech and hence sound more natural. 

3. Claims 6-7, 20-21 and 26-27 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Oikawa et al (US Patent No. 5,396,577), hereinafter referred to as Oikawa, in view 
of Church (US Patent No. 5,146,405). 

Oikawa discloses a speech synthesis apparatus for rapid speed reading. 

Regarding claims 6, 20, and 260ikawa et al teaches receiving input text and performing 
text analysis at Figure 2, sub-block 2; col. 3, lines 13-50, which reads on "performing analysis of 
said text segment." 
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Assigning playback rates to segments based on categorizations of a determined degree of 
importance for the text at col. 3, line 37 - col. 5, line 4, which reads on "assigning a playing rate 
indicator to each word of said text segment." 

Oikawa does not specify that the text analysis and categorizations of the determined 
degree of importance be based on grammatical analysis. However, implementation of 
grammatical analysis for speech synthesis was well known in the art. 

In a similar field of endeavor, Church teaches a method for part-of-speech determination 
and usage which implements grammatical analysis of text and identifies parts of speech 
(including nouns) of the text (Abstract), and suggests that speech synthesis needs parts-of-speech 
analysis of input text to produce a result that sounds like human speech (col. 1, lines 14-24 

Therefore, it would have been obvious to one of ordinary skill at the time of invention to 
perform grammatical analysis of text to identify parts of speech of the text, as taught by Church, 
to implement parts of speech as the categories of degrees of importance in the speech synthesis 
system of Oikawa, for the purpose of producing synthetic speech that sounds more like human 
speech, as taught by Church. 

Regarding claims 7, 21, and 27, Oikawa and Church teach everything as claimed in 
claims 6, 20, and 26. Additionally, Oikawa teaches generating synthetic speech based on the 
assigned playback rates and allows for the omission of speech for segments in which an 
indication of a slow playing rate was identified at col. 5, lines 28-37. 
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4. Claims 8-10 and 12 are rejected under 35 U.S.C. 103(a) as being unpatentable over 

Oikawa in view of Church as applied to claim 7 above, and further in view of Hutchins 
(US Patent No. 5,384,893). 
Regarding claim 8, Oikawa and Church teach everything as claimed in claim 7. Oikawa and 
Church do not specifically teach decreasing the duration of pauses associated with selected 
punctuation in a text segment. However, it is well known to modify output duration of text-to- 
speech synthetic speech to reflect various punctuation in the text of interest. 

In a similar field of endeavor, Hutchins discloses a method and apparatus for speech 
synthesis based on prosodic analysis. Specifically, at col. 7, lines 44-56, Hutchins teaches 
implementation of a prosody generator, which "adjusts duration before and stress following 
punctuation.*' 

Therefore, it would have been obvious to one of ordinary skill at the time of invention to 
modify the speech synthesis system of Oikawa to implement modifying the duration of pauses . 
associated with punctuation in a text segment, as taught by Hutchins, for the purpose of making 
the synthetic speech sound more like conversational speech and hence sound more natural. 

Regarding claim 9, Oikawa, Church and Hutchins teach everything as claimed in claim 8. 
Oikawa does not specifically disclose identifying parts of speech of the analyzed text. However, 
implementation of identification of parts of speech for text for implementation in text-to-speech 
synthesis was well known in the art. 

In a similar field of endeavor, Church teaches a method for part-of-speech determination 
and usage which implements grammatical analysis of text and identifies parts of speech 
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(including nouns) of the text (Abstract), and suggests that speech synthesis needs parts-of- speech 
analysis of input text to produce a result that sounds like human speech (col. 1, lines 14-24 

Therefore, it would have been obvious to one of ordinary skill at the time of invention to 
perform grammatical analysis of text to identify parts of speech of the text, as taught by Church, 
to implement parts of speech as the categories of degrees of importance in the speech synthesis 
system of Oikawa, for the purpose of producing synthetic speech that sounds more like human 
speech, as taught by Church. 

Regarding claim 10, Oikawa, Church and Hutchins teach everything as claimed in claim 

9. Additionally, Additionally, Oikawa teaches generating synthetic speech based on the assigned 
playback rates and allows for the omission of speech for segments in which an indication of a 
slow playing rate was identified at col. 5, lines 28-37. 

Regarding claim 12, Oikawa, Church and Hutchins teach everything as claimed in claim 

10. Additionally, Additionally, Oikawa teaches generating synthetic speech based on the 
assigned playback rates and allows for the omission of speech for segments in which an 
indication of a slow playing rate was identified at col. 5, lines 28-37. 

5. Claims 13-14, 22-23 and 28-29 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Oikawa in view of Richard et al (US Patent No. 5,924,068), hereinafter referred to 
as Richard. 

Oikawa discloses a speech synthesis apparatus for rapid speed reading. 
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Regarding claims 13, 22, and 28, Oikawa et al teaches receiving input text and 
performing text analysis at Figure 2, sub-block 2; col. 3, lines 13-50, which reads on "performing 
analysis of said text segment." 

Assigning playback rates to segments based on categorizations of a determined degree of 
importance for the text at col. 3, line 37 - col. 5, line 4, which reads on "assigning a playing rate 
indicator to each word of said text segment." 

Oikawa et al do not specify that the categorizations of the determined degree of 
importance be based on an inventory of pre-selected words. However, implementation of storing 
a user desired inventory of keywords was well known in the art. 

In a similar field of endeavor, Richard teaches a system that uses text-to-speech synthesis 
to provide audio output of user selected text which implements a dictionary to provide syntactic 
and semantic prosody and allows users to provide keywords which are used to determine what 
information is to be selected for synthesis. Users are able to determine which information is read 
and vary the rate at which the information is read (Abstract). 

Therefore, it would have been obvious to one of ordinary skill at the time of invention to 
implement pre-selected words as the categories of degrees of importance in the speech synthesis 
system of Oikawa, for the purpose of allowing system users to determine what text is synthesized 
and vary the rate at which the synthetic speech is produced, as suggested by Richard. 

Regarding claims 14, 23, and 29, Oikawa and Richard teach everything as claimed in 
claims 13, 22, and 28. Additionally, Oikawa teaches generating synthetic speech based on the 
assigned playback rates and allows for the omission of speech for segments in which an 
indication of a slow playing rate was identified at col. 5, lines 28-37. 
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6. Claims 15-17 are rejected under 35 U.S.C. 103(a) as being unpatentable over Oikawa in 
view of Richard as applied to claim 14 above, and further in view of Hutchins (US Patent 
No. 5,384,893). 

Regarding claim 15, Oikawa and Richard teach everything as claimed in claim 7. Oikawa 
and Church do not specifically teach decreasing the duration of pauses associated with selected 
punctuation in a text segment. However, it is well known to modify output duration of text-to- 
speech synthetic speech to reflect various punctuation in the text of interest. 

In a similar field of endeavor, Hutchins discloses a method and apparatus for speech 
synthesis based on prosodic analysis. Specifically, at col. 7, lines 44-56, Hutchins teaches 
implementation of a prosody generator, which "adjusts duration before and stress following 
punctuation." 

Therefore, it would have been obvious to one of ordinary skill at the time of invention to 
modify the speech synthesis system of Oikawa to implement modifying the duration of pauses 
associated with punctuation in a text segment, as taught by Hutchins, for the purpose of making 
the synthetic speech sound more like conversational speech and hence sound more natural. 

Regarding claim 16, Oikawa, Richard and Hutchins teach everything as claimed in claim 

15. Additionally, Additionally, Oikawa teaches generating synthetic speech based on the 
assigned playback rates and allows for the omission of speech for segments in which an 
indication of a slow playing rate was identified at col. 5, lines 28-37. 

Regarding claim 17, Oikawa, Richard and Hutchins teach everything as claimed in claim 

16. Additionally, Additionally, Oikawa teaches generating synthetic speech based on the 
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assigned playback rates and allows for the omission of speech for segments in which an 
indication of a slow playing rate was identified at col. 5, lines 28-37. 



Response to Arguments 
7. Applicant's arguments filed April 8, 2004 have been fully considered but they are not 
persuasive. 

Regarding claims 1,18, and 24, Applicant argues Kosaka fails to teach using the same 
syllable count to expand or reduce the VCV parameters. The Examiner disagrees and argues that 
Kosaka et al discloses using the same syllable count to expand or reduce the VCV parameters at 
col. 9, lines 39-40, as indicated in the rejection above. 

Regarding claims 6, 20, and 26, applicant argues Oikawa does not appear to disclose, 
teach or suggest assigning an importance degree value to each word in a text segment. Applicant 
is referred to Oikawa col. 3, lines 30-34, in which Oikawa teaches assigning degree of 
importance based on content of the text and to Church col. 10, lines 8-63, in which Church 
specifically describes the importance of parts of speech in speech synthesis and indicates that 
"pronunciation sometimes depends on part of speech" (col. 10, lines 15-16). Further, Church 
specifically describes the categorization or degree of importance for each word of the phrase "I 
see a bird" (col. 6, lines 9-30). 

In response to applicant's argument that there is no suggestion to combine the references, 
the examiner recognizes that obviousness can only be established by combining or modifying the 
teachings of the prior art to produce the claimed invention where there is some teaching, 
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suggestion, or motivation to do so found either in the references themselves or in the knowledge 
generally available to one of ordinary skill in the art. See In re Fine, 837 F.2d 1071, 5 
USPQ2d 1596 (Fed. Cir. 1988) and In re Jones, 958 F.2d 347, 21 USPQ2d 1941 (Fed. Cir. 
1992). In this case, Richard specifically teaches the advantages of allowing system users to 
determine what text is synthesized and vary the rate at which the synthetic speech is produced. 



Conclusion 

8. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Angela A. Armstrong whose telephone number is 703-308-6258. 
The examiner can normally be reached on Monday-Thursday 7:30-5:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (703) 305-9645. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



Angela A. Armstrong 

Examiner 

Art Unit 2654 
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June 27, 2004 




